Overview

Dataset statistics

Number of variables26
Number of observations205
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory154.4 KiB
Average record size in memory771.2 B

Variable types

NUM16
CAT10

Reproduction

Analysis started2020-06-11 04:09:31.632681
Analysis finished2020-06-11 04:10:14.322055
Duration42.69 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

highway-mpg is highly correlated with city-mpgHigh correlation
city-mpg is highly correlated with highway-mpgHigh correlation
fuel-system is highly correlated with fuel-typeHigh correlation
fuel-type is highly correlated with fuel-systemHigh correlation
symboling has 67 (32.7%) zeros Zeros

Variables

symboling
Real number (ℝ)

ZEROS

Distinct count6
Unique (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.8341463414634146
Minimum-2
Maximum3
Zeros67
Zeros (%)32.7%
Memory size1.7 KiB

Quantile statistics

Minimum-2
5-th percentile-1
Q10
median1
Q32
95-th percentile3
Maximum3
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.245306828
Coefficient of variation (CV)1.492911695
Kurtosis-0.6762713562
Mean0.8341463415
Median Absolute Deviation (MAD)1
Skewness0.2110722721
Sum171
Variance1.550789096
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
06732.7%
 
15426.3%
 
23215.6%
 
32713.2%
 
-12210.7%
 
-231.5%
 
ValueCountFrequency (%) 
-231.5%
 
-12210.7%
 
06732.7%
 
15426.3%
 
23215.6%
 
32713.2%
 
ValueCountFrequency (%) 
32713.2%
 
23215.6%
 
15426.3%
 
06732.7%
 
-12210.7%
 
-231.5%
 

normalized-losses
Real number (ℝ≥0)

Distinct count51
Unique (%)24.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean120.6
Minimum65
Maximum256
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum65
5-th percentile77.2
Q1101
median115
Q3137
95-th percentile182.4
Maximum256
Range191
Interquartile range (IQR)36

Descriptive statistics

Standard deviation31.80510503
Coefficient of variation (CV)0.2637239223
Kurtosis1.499388343
Mean120.6
Median Absolute Deviation (MAD)19
Skewness0.9761135439
Sum24723
Variance1011.564706
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1154421.5%
 
161115.4%
 
9183.9%
 
15073.4%
 
10462.9%
 
12862.9%
 
13462.9%
 
9552.4%
 
16852.4%
 
6552.4%
 
10352.4%
 
9452.4%
 
7452.4%
 
8552.4%
 
10252.4%
 
9342.0%
 
10642.0%
 
11842.0%
 
12242.0%
 
14842.0%
 
13731.5%
 
8331.5%
 
10131.5%
 
12531.5%
 
15431.5%
 
Other values (26)4220.5%
 
ValueCountFrequency (%) 
6552.4%
 
7452.4%
 
7710.5%
 
7810.5%
 
8121.0%
 
8331.5%
 
8552.4%
 
8721.0%
 
8921.0%
 
9010.5%
 
ValueCountFrequency (%) 
25610.5%
 
23110.5%
 
19721.0%
 
19421.0%
 
19221.0%
 
18821.0%
 
18610.5%
 
16852.4%
 
16421.0%
 
161115.4%
 

make
Categorical

Distinct count22
Unique (%)10.7%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
toyota
32
nissan
 
18
mazda
 
17
mitsubishi
 
13
honda
 
13
Other values (17)
112
ValueCountFrequency (%) 
toyota3215.6%
 
nissan188.8%
 
mazda178.3%
 
mitsubishi136.3%
 
honda136.3%
 
volkswagen125.9%
 
subaru125.9%
 
peugot115.4%
 
volvo115.4%
 
dodge94.4%
 
bmw83.9%
 
mercedes-benz83.9%
 
plymouth73.4%
 
audi73.4%
 
saab62.9%
 
porsche52.4%
 
isuzu42.0%
 
alfa-romero31.5%
 
jaguar31.5%
 
chevrolet31.5%
 
renault21.0%
 
mercury10.5%
 

Length

Max length13
Median length6
Mean length6.47804878
Min length3

Overview of Unicode Properties

Unique unicode characters25
Unique unicode categories (?)2
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a15411.6%
 
o15211.4%
 
s1098.2%
 
t1007.5%
 
e816.1%
 
u765.7%
 
n715.3%
 
i685.1%
 
d634.7%
 
m574.3%
 
b473.5%
 
r413.1%
 
h413.1%
 
y403.0%
 
l382.9%
 
v372.8%
 
g352.6%
 
z292.2%
 
p231.7%
 
w201.5%
 
c171.3%
 
k120.9%
 
-110.8%
 
f30.2%
 
j30.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter131799.2%
 
Dash Punctuation110.8%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a15411.7%
 
o15211.5%
 
s1098.3%
 
t1007.6%
 
e816.2%
 
u765.8%
 
n715.4%
 
i685.2%
 
d634.8%
 
m574.3%
 
b473.6%
 
r413.1%
 
h413.1%
 
y403.0%
 
l382.9%
 
v372.8%
 
g352.7%
 
z292.2%
 
p231.7%
 
w201.5%
 
c171.3%
 
k120.9%
 
f30.2%
 
j30.2%
 

Most frequent Dash Punctuation characters

ValueCountFrequency (%) 
-11100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin131799.2%
 
Common110.8%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a15411.7%
 
o15211.5%
 
s1098.3%
 
t1007.6%
 
e816.2%
 
u765.8%
 
n715.4%
 
i685.2%
 
d634.8%
 
m574.3%
 
b473.6%
 
r413.1%
 
h413.1%
 
y403.0%
 
l382.9%
 
v372.8%
 
g352.7%
 
z292.2%
 
p231.7%
 
w201.5%
 
c171.3%
 
k120.9%
 
f30.2%
 
j30.2%
 

Most frequent Common characters

ValueCountFrequency (%) 
-11100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1328100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a15411.6%
 
o15211.4%
 
s1098.2%
 
t1007.5%
 
e816.1%
 
u765.7%
 
n715.3%
 
i685.1%
 
d634.7%
 
m574.3%
 
b473.5%
 
r413.1%
 
h413.1%
 
y403.0%
 
l382.9%
 
v372.8%
 
g352.6%
 
z292.2%
 
p231.7%
 
w201.5%
 
c171.3%
 
k120.9%
 
-110.8%
 
f30.2%
 
j30.2%
 

fuel-type
Categorical

HIGH CORRELATION

Distinct count2
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
gas
185
diesel
 
20
ValueCountFrequency (%) 
gas18590.2%
 
diesel209.8%
 

Length

Max length6
Median length3
Mean length3.292682927
Min length3

Overview of Unicode Properties

Unique unicode characters7
Unique unicode categories (?)1
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
s20530.4%
 
g18527.4%
 
a18527.4%
 
e405.9%
 
d203.0%
 
i203.0%
 
l203.0%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter675100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
s20530.4%
 
g18527.4%
 
a18527.4%
 
e405.9%
 
d203.0%
 
i203.0%
 
l203.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin675100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
s20530.4%
 
g18527.4%
 
a18527.4%
 
e405.9%
 
d203.0%
 
i203.0%
 
l203.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII675100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
s20530.4%
 
g18527.4%
 
a18527.4%
 
e405.9%
 
d203.0%
 
i203.0%
 
l203.0%
 

aspiration
Categorical

Distinct count2
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
std
168
turbo
37
ValueCountFrequency (%) 
std16882.0%
 
turbo3718.0%
 

Length

Max length5
Median length3
Mean length3.36097561
Min length3

Overview of Unicode Properties

Unique unicode characters7
Unique unicode categories (?)1
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
t20529.8%
 
s16824.4%
 
d16824.4%
 
u375.4%
 
r375.4%
 
b375.4%
 
o375.4%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter689100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
t20529.8%
 
s16824.4%
 
d16824.4%
 
u375.4%
 
r375.4%
 
b375.4%
 
o375.4%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin689100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
t20529.8%
 
s16824.4%
 
d16824.4%
 
u375.4%
 
r375.4%
 
b375.4%
 
o375.4%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII689100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
t20529.8%
 
s16824.4%
 
d16824.4%
 
u375.4%
 
r375.4%
 
b375.4%
 
o375.4%
 

num-of-doors
Categorical

Distinct count2
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
four
116
two
89
ValueCountFrequency (%) 
four11656.6%
 
two8943.4%
 

Length

Max length4
Median length4
Mean length3.565853659
Min length3

Overview of Unicode Properties

Unique unicode characters6
Unique unicode categories (?)1
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o20528.0%
 
f11615.9%
 
u11615.9%
 
r11615.9%
 
t8912.2%
 
w8912.2%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter731100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o20528.0%
 
f11615.9%
 
u11615.9%
 
r11615.9%
 
t8912.2%
 
w8912.2%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin731100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o20528.0%
 
f11615.9%
 
u11615.9%
 
r11615.9%
 
t8912.2%
 
w8912.2%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII731100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o20528.0%
 
f11615.9%
 
u11615.9%
 
r11615.9%
 
t8912.2%
 
w8912.2%
 

body-style
Categorical

Distinct count5
Unique (%)2.4%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
sedan
96
hatchback
70
wagon
25
hardtop
 
8
convertible
 
6
ValueCountFrequency (%) 
sedan9646.8%
 
hatchback7034.1%
 
wagon2512.2%
 
hardtop83.9%
 
convertible62.9%
 

Length

Max length11
Median length5
Mean length6.619512195
Min length5

Overview of Unicode Properties

Unique unicode characters18
Unique unicode categories (?)1
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
a26919.8%
 
h14810.9%
 
c14610.8%
 
n1279.4%
 
e1088.0%
 
d1047.7%
 
s967.1%
 
t846.2%
 
b765.6%
 
k705.2%
 
o392.9%
 
w251.8%
 
g251.8%
 
r141.0%
 
p80.6%
 
v60.4%
 
i60.4%
 
l60.4%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter1357100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
a26919.8%
 
h14810.9%
 
c14610.8%
 
n1279.4%
 
e1088.0%
 
d1047.7%
 
s967.1%
 
t846.2%
 
b765.6%
 
k705.2%
 
o392.9%
 
w251.8%
 
g251.8%
 
r141.0%
 
p80.6%
 
v60.4%
 
i60.4%
 
l60.4%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin1357100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
a26919.8%
 
h14810.9%
 
c14610.8%
 
n1279.4%
 
e1088.0%
 
d1047.7%
 
s967.1%
 
t846.2%
 
b765.6%
 
k705.2%
 
o392.9%
 
w251.8%
 
g251.8%
 
r141.0%
 
p80.6%
 
v60.4%
 
i60.4%
 
l60.4%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1357100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
a26919.8%
 
h14810.9%
 
c14610.8%
 
n1279.4%
 
e1088.0%
 
d1047.7%
 
s967.1%
 
t846.2%
 
b765.6%
 
k705.2%
 
o392.9%
 
w251.8%
 
g251.8%
 
r141.0%
 
p80.6%
 
v60.4%
 
i60.4%
 
l60.4%
 

drive-wheels
Categorical

Distinct count3
Unique (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
fwd
120
rwd
76
4wd
 
9
ValueCountFrequency (%) 
fwd12058.5%
 
rwd7637.1%
 
4wd94.4%
 

Length

Max length3
Median length3
Mean length3
Min length3

Overview of Unicode Properties

Unique unicode characters5
Unique unicode categories (?)2
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
w20533.3%
 
d20533.3%
 
f12019.5%
 
r7612.4%
 
491.5%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter60698.5%
 
Decimal Number91.5%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
w20533.8%
 
d20533.8%
 
f12019.8%
 
r7612.5%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
49100.0%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin60698.5%
 
Common91.5%
 

Most frequent Latin characters

ValueCountFrequency (%) 
w20533.8%
 
d20533.8%
 
f12019.8%
 
r7612.5%
 

Most frequent Common characters

ValueCountFrequency (%) 
49100.0%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII615100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
w20533.3%
 
d20533.3%
 
f12019.5%
 
r7612.4%
 
491.5%
 

engine-location
Categorical

Distinct count2
Unique (%)1.0%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
front
202
rear
 
3
ValueCountFrequency (%) 
front20298.5%
 
rear31.5%
 

Length

Max length5
Median length5
Mean length4.985365854
Min length4

Overview of Unicode Properties

Unique unicode characters7
Unique unicode categories (?)1
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
r20820.4%
 
f20219.8%
 
o20219.8%
 
n20219.8%
 
t20219.8%
 
e30.3%
 
a30.3%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter1022100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
r20820.4%
 
f20219.8%
 
o20219.8%
 
n20219.8%
 
t20219.8%
 
e30.3%
 
a30.3%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin1022100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
r20820.4%
 
f20219.8%
 
o20219.8%
 
n20219.8%
 
t20219.8%
 
e30.3%
 
a30.3%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII1022100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
r20820.4%
 
f20219.8%
 
o20219.8%
 
n20219.8%
 
t20219.8%
 
e30.3%
 
a30.3%
 

wheel-base
Real number (ℝ≥0)

Distinct count53
Unique (%)25.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean98.75658536585367
Minimum86.6
Maximum120.9
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum86.6
5-th percentile93.02
Q194.5
median97
Q3102.4
95-th percentile110
Maximum120.9
Range34.3
Interquartile range (IQR)7.9

Descriptive statistics

Standard deviation6.021775685
Coefficient of variation (CV)0.06097594062
Kurtosis1.017038946
Mean98.75658537
Median Absolute Deviation (MAD)2.7
Skewness1.050213776
Sum20245.1
Variance36.2617824
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
94.52110.2%
 
93.7209.8%
 
95.7136.3%
 
96.583.9%
 
98.473.4%
 
97.373.4%
 
96.362.9%
 
107.962.9%
 
98.862.9%
 
99.162.9%
 
104.362.9%
 
100.462.9%
 
93.152.4%
 
97.252.4%
 
102.452.4%
 
109.152.4%
 
95.952.4%
 
101.242.0%
 
9742.0%
 
114.242.0%
 
95.342.0%
 
105.831.5%
 
103.531.5%
 
11031.5%
 
89.531.5%
 
Other values (28)4019.5%
 
ValueCountFrequency (%) 
86.621.0%
 
88.410.5%
 
88.621.0%
 
89.531.5%
 
91.321.0%
 
9310.5%
 
93.152.4%
 
93.310.5%
 
93.7209.8%
 
94.310.5%
 
ValueCountFrequency (%) 
120.910.5%
 
115.621.0%
 
114.242.0%
 
11321.0%
 
11210.5%
 
11031.5%
 
109.152.4%
 
10810.5%
 
107.962.9%
 
106.710.5%
 

length
Real number (ℝ≥0)

Distinct count75
Unique (%)36.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean174.04926829268288
Minimum141.1
Maximum208.1
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum141.1
5-th percentile157.14
Q1166.3
median173.2
Q3183.1
95-th percentile196.36
Maximum208.1
Range67
Interquartile range (IQR)16.8

Descriptive statistics

Standard deviation12.33728853
Coefficient of variation (CV)0.0708838862
Kurtosis-0.08289485345
Mean174.0492683
Median Absolute Deviation (MAD)6.9
Skewness0.1559537713
Sum35680.1
Variance152.2086882
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
157.3157.3%
 
188.8115.4%
 
166.373.4%
 
171.773.4%
 
186.773.4%
 
165.362.9%
 
177.862.9%
 
176.262.9%
 
186.662.9%
 
176.852.4%
 
17252.4%
 
175.652.4%
 
173.252.4%
 
172.442.0%
 
16942.0%
 
168.742.0%
 
198.942.0%
 
168.942.0%
 
192.731.5%
 
158.731.5%
 
155.931.5%
 
170.731.5%
 
169.731.5%
 
159.131.5%
 
15031.5%
 
Other values (50)7335.6%
 
ValueCountFrequency (%) 
141.110.5%
 
144.621.0%
 
15031.5%
 
155.931.5%
 
156.910.5%
 
157.110.5%
 
157.3157.3%
 
157.910.5%
 
158.731.5%
 
158.810.5%
 
ValueCountFrequency (%) 
208.110.5%
 
202.621.0%
 
199.621.0%
 
199.210.5%
 
198.942.0%
 
19710.5%
 
193.810.5%
 
192.731.5%
 
191.710.5%
 
190.921.0%
 

width
Real number (ℝ≥0)

Distinct count44
Unique (%)21.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65.90780487804878
Minimum60.3
Maximum72.3
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum60.3
5-th percentile63.6
Q164.1
median65.5
Q366.9
95-th percentile70.46
Maximum72.3
Range12
Interquartile range (IQR)2.8

Descriptive statistics

Standard deviation2.145203853
Coefficient of variation (CV)0.03254855562
Kurtosis0.7027642441
Mean65.90780488
Median Absolute Deviation (MAD)1.4
Skewness0.9040034988
Sum13511.1
Variance4.60189957
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
63.82411.7%
 
66.52311.2%
 
65.4157.3%
 
63.6115.4%
 
64.4104.9%
 
68.4104.9%
 
6494.4%
 
65.583.9%
 
65.273.4%
 
66.362.9%
 
64.262.9%
 
67.262.9%
 
65.662.9%
 
67.952.4%
 
66.952.4%
 
68.942.0%
 
64.842.0%
 
65.742.0%
 
6531.5%
 
63.931.5%
 
71.431.5%
 
71.731.5%
 
70.331.5%
 
64.621.0%
 
64.121.0%
 
Other values (19)2311.2%
 
ValueCountFrequency (%) 
60.310.5%
 
61.810.5%
 
62.510.5%
 
63.410.5%
 
63.6115.4%
 
63.82411.7%
 
63.931.5%
 
6494.4%
 
64.121.0%
 
64.262.9%
 
ValueCountFrequency (%) 
72.310.5%
 
7210.5%
 
71.731.5%
 
71.431.5%
 
70.910.5%
 
70.610.5%
 
70.510.5%
 
70.331.5%
 
69.621.0%
 
68.942.0%
 

height
Real number (ℝ≥0)

Distinct count49
Unique (%)23.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean53.72487804878049
Minimum47.8
Maximum59.8
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum47.8
5-th percentile49.7
Q152
median54.1
Q355.5
95-th percentile57.5
Maximum59.8
Range12
Interquartile range (IQR)3.5

Descriptive statistics

Standard deviation2.44352197
Coefficient of variation (CV)0.04548213153
Kurtosis-0.4438123651
Mean53.72487805
Median Absolute Deviation (MAD)1.6
Skewness0.06312273247
Sum11013.6
Variance5.970799617
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
50.8146.8%
 
52125.9%
 
55.7125.9%
 
54.5104.9%
 
54.1104.9%
 
55.594.4%
 
56.783.9%
 
54.383.9%
 
51.673.4%
 
56.173.4%
 
52.673.4%
 
50.262.9%
 
5362.9%
 
54.962.9%
 
52.862.9%
 
53.752.4%
 
55.152.4%
 
50.652.4%
 
53.342.0%
 
58.742.0%
 
49.642.0%
 
57.531.5%
 
53.531.5%
 
49.731.5%
 
59.131.5%
 
Other values (24)3818.5%
 
ValueCountFrequency (%) 
47.810.5%
 
48.821.0%
 
49.421.0%
 
49.642.0%
 
49.731.5%
 
50.262.9%
 
50.521.0%
 
50.652.4%
 
50.8146.8%
 
5110.5%
 
ValueCountFrequency (%) 
59.821.0%
 
59.131.5%
 
58.742.0%
 
58.310.5%
 
57.531.5%
 
56.783.9%
 
56.521.0%
 
56.321.0%
 
56.231.5%
 
56.173.4%
 

curb-weight
Real number (ℝ≥0)

Distinct count171
Unique (%)83.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2555.5658536585365
Minimum1488
Maximum4066
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum1488
5-th percentile1901
Q12145
median2414
Q32935
95-th percentile3503
Maximum4066
Range2578
Interquartile range (IQR)790

Descriptive statistics

Standard deviation520.6802035
Coefficient of variation (CV)0.2037436064
Kurtosis-0.0428537661
Mean2555.565854
Median Absolute Deviation (MAD)386
Skewness0.6813981891
Sum523891
Variance271107.8743
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
238542.0%
 
198931.5%
 
191831.5%
 
227531.5%
 
323021.0%
 
241021.0%
 
325221.0%
 
233721.0%
 
240321.0%
 
241421.0%
 
214521.0%
 
313921.0%
 
239521.0%
 
187621.0%
 
187421.0%
 
212821.0%
 
196721.0%
 
238021.0%
 
406621.0%
 
275621.0%
 
257921.0%
 
307521.0%
 
253521.0%
 
254821.0%
 
190921.0%
 
Other values (146)15073.2%
 
ValueCountFrequency (%) 
148810.5%
 
171310.5%
 
181910.5%
 
183710.5%
 
187421.0%
 
187621.0%
 
188910.5%
 
189010.5%
 
190010.5%
 
190510.5%
 
ValueCountFrequency (%) 
406621.0%
 
395010.5%
 
390010.5%
 
377010.5%
 
375010.5%
 
374010.5%
 
371510.5%
 
368510.5%
 
351510.5%
 
350510.5%
 

engine-type
Categorical

Distinct count7
Unique (%)3.4%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
ohc
148
ohcf
 
15
ohcv
 
13
l
 
12
dohc
 
12
Other values (2)
 
5
ValueCountFrequency (%) 
ohc14872.2%
 
ohcf157.3%
 
ohcv136.3%
 
l125.9%
 
dohc125.9%
 
rotor42.0%
 
dohcv10.5%
 

Length

Max length5
Median length3
Mean length3.126829268
Min length1

Overview of Unicode Properties

Unique unicode characters9
Unique unicode categories (?)1
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
o19730.7%
 
h18929.5%
 
c18929.5%
 
f152.3%
 
v142.2%
 
d132.0%
 
l121.9%
 
r81.2%
 
t40.6%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter641100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
o19730.7%
 
h18929.5%
 
c18929.5%
 
f152.3%
 
v142.2%
 
d132.0%
 
l121.9%
 
r81.2%
 
t40.6%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin641100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
o19730.7%
 
h18929.5%
 
c18929.5%
 
f152.3%
 
v142.2%
 
d132.0%
 
l121.9%
 
r81.2%
 
t40.6%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII641100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
o19730.7%
 
h18929.5%
 
c18929.5%
 
f152.3%
 
v142.2%
 
d132.0%
 
l121.9%
 
r81.2%
 
t40.6%
 

num-of-cylinders
Categorical

Distinct count7
Unique (%)3.4%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
four
159
six
 
24
five
 
11
eight
 
5
two
 
4
Other values (2)
 
2
ValueCountFrequency (%) 
four15977.6%
 
six2411.7%
 
five115.4%
 
eight52.4%
 
two42.0%
 
three10.5%
 
twelve10.5%
 

Length

Max length6
Median length4
Mean length3.902439024
Min length3

Overview of Unicode Properties

Unique unicode characters14
Unique unicode categories (?)1
Unique unicode scripts (?)1
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
f17021.2%
 
o16320.4%
 
r16020.0%
 
u15919.9%
 
i405.0%
 
s243.0%
 
x243.0%
 
e202.5%
 
v121.5%
 
t111.4%
 
h60.8%
 
w50.6%
 
g50.6%
 
l10.1%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter800100.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
f17021.2%
 
o16320.4%
 
r16020.0%
 
u15919.9%
 
i405.0%
 
s243.0%
 
x243.0%
 
e202.5%
 
v121.5%
 
t111.4%
 
h60.8%
 
w50.6%
 
g50.6%
 
l10.1%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin800100.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
f17021.2%
 
o16320.4%
 
r16020.0%
 
u15919.9%
 
i405.0%
 
s243.0%
 
x243.0%
 
e202.5%
 
v121.5%
 
t111.4%
 
h60.8%
 
w50.6%
 
g50.6%
 
l10.1%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII800100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
f17021.2%
 
o16320.4%
 
r16020.0%
 
u15919.9%
 
i405.0%
 
s243.0%
 
x243.0%
 
e202.5%
 
v121.5%
 
t111.4%
 
h60.8%
 
w50.6%
 
g50.6%
 
l10.1%
 

engine-size
Real number (ℝ≥0)

Distinct count44
Unique (%)21.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean126.90731707317073
Minimum61
Maximum326
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum61
5-th percentile90
Q197
median120
Q3141
95-th percentile201.2
Maximum326
Range265
Interquartile range (IQR)44

Descriptive statistics

Standard deviation41.64269344
Coefficient of variation (CV)0.3281346923
Kurtosis5.305682092
Mean126.9073171
Median Absolute Deviation (MAD)23
Skewness1.947655045
Sum26016
Variance1734.113917
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
122157.3%
 
92157.3%
 
98146.8%
 
97146.8%
 
108136.3%
 
90125.9%
 
110125.9%
 
10983.9%
 
12073.4%
 
14173.4%
 
15262.9%
 
12162.9%
 
14662.9%
 
18162.9%
 
9152.4%
 
15652.4%
 
13652.4%
 
13042.0%
 
18342.0%
 
19431.5%
 
16431.5%
 
17131.5%
 
20931.5%
 
7031.5%
 
23421.0%
 
Other values (19)2411.7%
 
ValueCountFrequency (%) 
6110.5%
 
7031.5%
 
7910.5%
 
8010.5%
 
90125.9%
 
9152.4%
 
92157.3%
 
97146.8%
 
98146.8%
 
10310.5%
 
ValueCountFrequency (%) 
32610.5%
 
30810.5%
 
30410.5%
 
25821.0%
 
23421.0%
 
20931.5%
 
20310.5%
 
19431.5%
 
18342.0%
 
18162.9%
 

fuel-system
Categorical

HIGH CORRELATION

Distinct count8
Unique (%)3.9%
Missing0
Missing (%)0.0%
Memory size1.7 KiB
mpfi
94
2bbl
66
idi
20
1bbl
 
11
spdi
 
9
Other values (3)
 
5
ValueCountFrequency (%) 
mpfi9445.9%
 
2bbl6632.2%
 
idi209.8%
 
1bbl115.4%
 
spdi94.4%
 
4bbl31.5%
 
spfi10.5%
 
mfi10.5%
 

Length

Max length4
Median length4
Mean length3.897560976
Min length3

Overview of Unicode Properties

Unique unicode characters11
Unique unicode categories (?)2
Unique unicode scripts (?)2
Unique unicode blocks (?)1
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Most occurring characters

ValueCountFrequency (%) 
b16020.0%
 
i14518.1%
 
p10413.0%
 
f9612.0%
 
m9511.9%
 
l8010.0%
 
2668.3%
 
d293.6%
 
1111.4%
 
s101.3%
 
430.4%
 

Most occurring categories

ValueCountFrequency (%) 
Lowercase Letter71990.0%
 
Decimal Number8010.0%
 

Most frequent Lowercase Letter characters

ValueCountFrequency (%) 
b16022.3%
 
i14520.2%
 
p10414.5%
 
f9613.4%
 
m9513.2%
 
l8011.1%
 
d294.0%
 
s101.4%
 

Most frequent Decimal Number characters

ValueCountFrequency (%) 
26682.5%
 
11113.8%
 
433.8%
 

Most occurring scripts

ValueCountFrequency (%) 
Latin71990.0%
 
Common8010.0%
 

Most frequent Latin characters

ValueCountFrequency (%) 
b16022.3%
 
i14520.2%
 
p10414.5%
 
f9613.4%
 
m9513.2%
 
l8011.1%
 
d294.0%
 
s101.4%
 

Most frequent Common characters

ValueCountFrequency (%) 
26682.5%
 
11113.8%
 
433.8%
 

Most occurring blocks

ValueCountFrequency (%) 
ASCII799100.0%
 

Most frequent ASCII characters

ValueCountFrequency (%) 
b16020.0%
 
i14518.1%
 
p10413.0%
 
f9612.0%
 
m9511.9%
 
l8010.0%
 
2668.3%
 
d293.6%
 
1111.4%
 
s101.3%
 
430.4%
 

bore
Real number (ℝ≥0)

Distinct count38
Unique (%)18.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.329365853658536
Minimum2.54
Maximum3.94
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum2.54
5-th percentile2.97
Q13.15
median3.31
Q33.58
95-th percentile3.78
Maximum3.94
Range1.4
Interquartile range (IQR)0.43

Descriptive statistics

Standard deviation0.2708575485
Coefficient of variation (CV)0.08135409576
Kurtosis-0.7853719335
Mean3.329365854
Median Absolute Deviation (MAD)0.26
Skewness0.02451054676
Sum682.52
Variance0.07336381157
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3.622311.2%
 
3.19209.8%
 
3.15157.3%
 
3.31125.9%
 
2.97125.9%
 
3.03125.9%
 
3.4694.4%
 
3.7883.9%
 
3.4383.9%
 
2.9173.4%
 
3.2773.4%
 
3.0562.9%
 
3.5862.9%
 
3.3962.9%
 
3.5462.9%
 
3.0152.4%
 
3.752.4%
 
3.3542.0%
 
3.7431.5%
 
3.5931.5%
 
3.1731.5%
 
3.2421.0%
 
3.1321.0%
 
3.6321.0%
 
3.821.0%
 
Other values (13)178.3%
 
ValueCountFrequency (%) 
2.5410.5%
 
2.6810.5%
 
2.9173.4%
 
2.9210.5%
 
2.97125.9%
 
2.9910.5%
 
3.0152.4%
 
3.03125.9%
 
3.0562.9%
 
3.0810.5%
 
ValueCountFrequency (%) 
3.9421.0%
 
3.821.0%
 
3.7883.9%
 
3.7610.5%
 
3.7431.5%
 
3.752.4%
 
3.6321.0%
 
3.622311.2%
 
3.6110.5%
 
3.610.5%
 

stroke
Real number (ℝ≥0)

Distinct count36
Unique (%)17.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.2560975609756095
Minimum2.07
Maximum4.17
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum2.07
5-th percentile2.64
Q13.11
median3.29
Q33.41
95-th percentile3.64
Maximum4.17
Range2.1
Interquartile range (IQR)0.3

Descriptive statistics

Standard deviation0.3136336539
Coefficient of variation (CV)0.09632194615
Kurtosis2.17811411
Mean3.256097561
Median Absolute Deviation (MAD)0.14
Skewness-0.6960331002
Sum667.5
Variance0.09836606887
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3.4209.8%
 
3.03146.8%
 
3.15146.8%
 
3.23146.8%
 
3.39136.3%
 
3.29136.3%
 
2.64115.4%
 
3.3594.4%
 
3.4683.9%
 
3.4162.9%
 
3.1962.9%
 
3.0762.9%
 
3.1162.9%
 
3.5862.9%
 
3.2762.9%
 
3.562.9%
 
3.5252.4%
 
3.6452.4%
 
3.8642.0%
 
3.4742.0%
 
3.5442.0%
 
3.931.5%
 
2.931.5%
 
3.121.0%
 
3.0821.0%
 
Other values (11)157.3%
 
ValueCountFrequency (%) 
2.0710.5%
 
2.1921.0%
 
2.3610.5%
 
2.64115.4%
 
2.6821.0%
 
2.7610.5%
 
2.821.0%
 
2.8710.5%
 
2.931.5%
 
3.03146.8%
 
ValueCountFrequency (%) 
4.1721.0%
 
3.931.5%
 
3.8642.0%
 
3.6452.4%
 
3.5862.9%
 
3.5442.0%
 
3.5252.4%
 
3.562.9%
 
3.4742.0%
 
3.4683.9%
 

compression-ratio
Real number (ℝ≥0)

Distinct count32
Unique (%)15.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.142536585365855
Minimum7.0
Maximum23.0
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum7
5-th percentile7.5
Q18.6
median9
Q39.4
95-th percentile21.82
Maximum23
Range16
Interquartile range (IQR)0.8

Descriptive statistics

Standard deviation3.972040322
Coefficient of variation (CV)0.3916219861
Kurtosis5.233054348
Mean10.14253659
Median Absolute Deviation (MAD)0.4
Skewness2.610862458
Sum2079.22
Variance15.77710432
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
94622.4%
 
9.42612.7%
 
8.5146.8%
 
9.5136.3%
 
9.3115.4%
 
8.794.4%
 
9.283.9%
 
883.9%
 
773.4%
 
2152.4%
 
7.552.4%
 
9.652.4%
 
2352.4%
 
8.452.4%
 
8.652.4%
 
21.542.0%
 
7.642.0%
 
1031.5%
 
22.531.5%
 
8.331.5%
 
8.831.5%
 
7.721.0%
 
8.121.0%
 
9.3110.5%
 
21.910.5%
 
Other values (7)73.4%
 
ValueCountFrequency (%) 
773.4%
 
7.552.4%
 
7.642.0%
 
7.721.0%
 
7.810.5%
 
883.9%
 
8.121.0%
 
8.331.5%
 
8.452.4%
 
8.5146.8%
 
ValueCountFrequency (%) 
2352.4%
 
22.710.5%
 
22.531.5%
 
2210.5%
 
21.910.5%
 
21.542.0%
 
2152.4%
 
11.510.5%
 
10.110.5%
 
1031.5%
 

horsepower
Real number (ℝ≥0)

Distinct count59
Unique (%)28.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean104.16585365853659
Minimum48
Maximum288
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum48
5-th percentile62
Q170
median95
Q3116
95-th percentile180.8
Maximum288
Range240
Interquartile range (IQR)46

Descriptive statistics

Standard deviation39.52973322
Coefficient of variation (CV)0.3794884008
Kurtosis2.685167834
Mean104.1658537
Median Absolute Deviation (MAD)25
Skewness1.403441029
Sum21354
Variance1562.599809
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
68199.3%
 
70115.4%
 
69104.9%
 
9594.4%
 
11694.4%
 
11083.9%
 
11462.9%
 
6262.9%
 
10162.9%
 
16062.9%
 
8862.9%
 
14552.4%
 
10252.4%
 
8452.4%
 
9752.4%
 
7652.4%
 
8252.4%
 
12342.0%
 
8642.0%
 
9242.0%
 
11142.0%
 
9031.5%
 
8531.5%
 
7331.5%
 
20731.5%
 
Other values (34)5124.9%
 
ValueCountFrequency (%) 
4810.5%
 
5221.0%
 
5510.5%
 
5621.0%
 
5810.5%
 
6010.5%
 
6262.9%
 
6410.5%
 
68199.3%
 
69104.9%
 
ValueCountFrequency (%) 
28810.5%
 
26210.5%
 
20731.5%
 
20010.5%
 
18421.0%
 
18231.5%
 
17621.0%
 
17510.5%
 
16221.0%
 
16121.0%
 

peak-rpm
Real number (ℝ≥0)

Distinct count23
Unique (%)11.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5126.09756097561
Minimum4150
Maximum6600
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum4150
5-th percentile4250
Q14800
median5200
Q35500
95-th percentile5980
Maximum6600
Range2450
Interquartile range (IQR)700

Descriptive statistics

Standard deviation477.0357719
Coefficient of variation (CV)0.09306022101
Kurtosis0.08484457282
Mean5126.097561
Median Absolute Deviation (MAD)300
Skewness0.06897885863
Sum1050850
Variance227563.1277
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
55003718.0%
 
48003617.6%
 
50002713.2%
 
52002512.2%
 
5400136.3%
 
600094.4%
 
525073.4%
 
580073.4%
 
450073.4%
 
415052.4%
 
420052.4%
 
435042.0%
 
475042.0%
 
510031.5%
 
425031.5%
 
440031.5%
 
590031.5%
 
660021.0%
 
575010.5%
 
530010.5%
 
465010.5%
 
490010.5%
 
560010.5%
 
ValueCountFrequency (%) 
415052.4%
 
420052.4%
 
425031.5%
 
435042.0%
 
440031.5%
 
450073.4%
 
465010.5%
 
475042.0%
 
48003617.6%
 
490010.5%
 
ValueCountFrequency (%) 
660021.0%
 
600094.4%
 
590031.5%
 
580073.4%
 
575010.5%
 
560010.5%
 
55003718.0%
 
5400136.3%
 
530010.5%
 
525073.4%
 

city-mpg
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count29
Unique (%)14.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25.21951219512195
Minimum13
Maximum49
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum13
5-th percentile16
Q119
median24
Q330
95-th percentile37
Maximum49
Range36
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.542141653
Coefficient of variation (CV)0.2594079379
Kurtosis0.5786483405
Mean25.2195122
Median Absolute Deviation (MAD)5
Skewness0.6637040288
Sum5170
Variance42.79961741
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
312813.7%
 
192713.2%
 
242210.7%
 
27146.8%
 
17136.3%
 
26125.9%
 
23125.9%
 
2183.9%
 
3083.9%
 
2583.9%
 
3873.4%
 
2873.4%
 
3762.9%
 
1662.9%
 
2242.0%
 
1531.5%
 
1831.5%
 
2931.5%
 
2031.5%
 
1421.0%
 
4910.5%
 
4710.5%
 
3210.5%
 
3310.5%
 
3410.5%
 
Other values (4)42.0%
 
ValueCountFrequency (%) 
1310.5%
 
1421.0%
 
1531.5%
 
1662.9%
 
17136.3%
 
1831.5%
 
192713.2%
 
2031.5%
 
2183.9%
 
2242.0%
 
ValueCountFrequency (%) 
4910.5%
 
4710.5%
 
4510.5%
 
3873.4%
 
3762.9%
 
3610.5%
 
3510.5%
 
3410.5%
 
3310.5%
 
3210.5%
 

highway-mpg
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count30
Unique (%)14.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean30.75121951219512
Minimum16
Maximum54
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum16
5-th percentile22
Q125
median30
Q334
95-th percentile42.8
Maximum54
Range38
Interquartile range (IQR)9

Descriptive statistics

Standard deviation6.886443131
Coefficient of variation (CV)0.2239404889
Kurtosis0.4400703815
Mean30.75121951
Median Absolute Deviation (MAD)5
Skewness0.5399971879
Sum6304
Variance47.423099
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
25199.3%
 
24178.3%
 
38178.3%
 
30167.8%
 
32167.8%
 
34146.8%
 
37136.3%
 
28136.3%
 
29104.9%
 
3394.4%
 
3183.9%
 
2283.9%
 
2373.4%
 
2752.4%
 
4342.0%
 
4131.5%
 
4231.5%
 
2631.5%
 
2021.0%
 
1921.0%
 
1821.0%
 
1621.0%
 
3621.0%
 
3921.0%
 
4621.0%
 
Other values (5)62.9%
 
ValueCountFrequency (%) 
1621.0%
 
1710.5%
 
1821.0%
 
1921.0%
 
2021.0%
 
2283.9%
 
2373.4%
 
24178.3%
 
25199.3%
 
2631.5%
 
ValueCountFrequency (%) 
5410.5%
 
5310.5%
 
5010.5%
 
4721.0%
 
4621.0%
 
4342.0%
 
4231.5%
 
4131.5%
 
3921.0%
 
38178.3%
 

price
Real number (ℝ≥0)

Distinct count186
Unique (%)90.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean13150.307317073171
Minimum5118
Maximum45400
Zeros0
Zeros (%)0.0%
Memory size1.7 KiB

Quantile statistics

Minimum5118
5-th percentile6197
Q17788
median10295
Q316500
95-th percentile32472.4
Maximum45400
Range40282
Interquartile range (IQR)8712

Descriptive statistics

Standard deviation7879.121326
Coefficient of variation (CV)0.599158722
Kurtosis3.374863565
Mean13150.30732
Median Absolute Deviation (MAD)3204
Skewness1.840979309
Sum2695813
Variance62080552.87
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1029552.4%
 
729521.0%
 
884521.0%
 
760921.0%
 
557221.0%
 
849521.0%
 
892121.0%
 
622921.0%
 
669221.0%
 
789821.0%
 
1815021.0%
 
795721.0%
 
777521.0%
 
927921.0%
 
1650021.0%
 
1349921.0%
 
1185010.5%
 
1294010.5%
 
647910.5%
 
698910.5%
 
1104810.5%
 
1719910.5%
 
519510.5%
 
1154910.5%
 
2817610.5%
 
Other values (161)16178.5%
 
ValueCountFrequency (%) 
511810.5%
 
515110.5%
 
519510.5%
 
534810.5%
 
538910.5%
 
539910.5%
 
549910.5%
 
557221.0%
 
609510.5%
 
618910.5%
 
ValueCountFrequency (%) 
4540010.5%
 
4131510.5%
 
4096010.5%
 
3702810.5%
 
3688010.5%
 
3600010.5%
 
3555010.5%
 
3505610.5%
 
3418410.5%
 
3402810.5%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

symbolingnormalized-lossesmakefuel-typeaspirationnum-of-doorsbody-styledrive-wheelsengine-locationwheel-baselengthwidthheightcurb-weightengine-typenum-of-cylindersengine-sizefuel-systemborestrokecompression-ratiohorsepowerpeak-rpmcity-mpghighway-mpgprice
03115alfa-romerogasstdtwoconvertiblerwdfront88.6168.864.148.82548dohcfour130mpfi3.472.689.01115000212713495
13115alfa-romerogasstdtwoconvertiblerwdfront88.6168.864.148.82548dohcfour130mpfi3.472.689.01115000212716500
21115alfa-romerogasstdtwohatchbackrwdfront94.5171.265.552.42823ohcvsix152mpfi2.683.479.01545000192616500
32164audigasstdfoursedanfwdfront99.8176.666.254.32337ohcfour109mpfi3.193.4010.01025500243013950
42164audigasstdfoursedan4wdfront99.4176.666.454.32824ohcfive136mpfi3.193.408.01155500182217450
52115audigasstdtwosedanfwdfront99.8177.366.353.12507ohcfive136mpfi3.193.408.51105500192515250
61158audigasstdfoursedanfwdfront105.8192.771.455.72844ohcfive136mpfi3.193.408.51105500192517710
71115audigasstdfourwagonfwdfront105.8192.771.455.72954ohcfive136mpfi3.193.408.51105500192518920
81158audigasturbofoursedanfwdfront105.8192.771.455.93086ohcfive131mpfi3.133.408.31405500172023875
90115audigasturbotwohatchback4wdfront99.5178.267.952.03053ohcfive131mpfi3.133.407.01605500162210295

Last rows

symbolingnormalized-lossesmakefuel-typeaspirationnum-of-doorsbody-styledrive-wheelsengine-locationwheel-baselengthwidthheightcurb-weightengine-typenum-of-cylindersengine-sizefuel-systemborestrokecompression-ratiohorsepowerpeak-rpmcity-mpghighway-mpgprice
195-174volvogasstdfourwagonrwdfront104.3188.867.257.53034ohcfour141mpfi3.783.159.51145400232813415
196-2103volvogasstdfoursedanrwdfront104.3188.867.256.22935ohcfour141mpfi3.783.159.51145400242815985
197-174volvogasstdfourwagonrwdfront104.3188.867.257.53042ohcfour141mpfi3.783.159.51145400242816515
198-2103volvogasturbofoursedanrwdfront104.3188.867.256.23045ohcfour130mpfi3.623.157.51625100172218420
199-174volvogasturbofourwagonrwdfront104.3188.867.257.53157ohcfour130mpfi3.623.157.51625100172218950
200-195volvogasstdfoursedanrwdfront109.1188.868.955.52952ohcfour141mpfi3.783.159.51145400232816845
201-195volvogasturbofoursedanrwdfront109.1188.868.855.53049ohcfour141mpfi3.783.158.71605300192519045
202-195volvogasstdfoursedanrwdfront109.1188.868.955.53012ohcvsix173mpfi3.582.878.81345500182321485
203-195volvodieselturbofoursedanrwdfront109.1188.868.955.53217ohcsix145idi3.013.4023.01064800262722470
204-195volvogasturbofoursedanrwdfront109.1188.868.955.53062ohcfour141mpfi3.783.159.51145400192522625